Adjusting the Frame: Biphasic Performative Control of Speech Rhythm

نویسندگان

  • Samuel Delalez
  • Christophe d'Alessandro
چکیده

Performative time and pitch scaling is a new research paradigm for prosodic analysis by synthesis. In this paper, a system for real-time recorded speech time and pitch scaling by the means of hands or feet gestures is designed and evaluated. Pitch is controlled with the preferred hand, using a stylus on a graphic tablet. Time is controlled using rhythmic frames, or constriction gestures, defined by pairs of control points. The ”Arsis” corresponds to the constriction (weak beat of the syllable) and the ”Thesis” corresponds to the vocalic nucleus (strong beat of the syllable). This biphasic control of rhythmic units is performed by the non-preferred hand using a button. Pitch and time scales are modified according to these gestural controls with the help of a real-time pitch synchronous overlap-add technique (RT-PSOLA). Rhythm and pitch control accuracy are assessed in a prosodic imitation experiment: the task is to reproduce intonation and rhythm of various sentences. The results show that inter-vocalic durations differ on average of only 20 ms. The system appears as a new and effective tool for performative speech and singing synthesis. Consequences and applications in speech prosody research are discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vokinesis: syllabic control points for performative singing synthesis

Performative control of voice is the process of real-time speech synthesis or modification by the means of hands or feet gestures. Vokinesis, a system for real-time rhythm and pitch modification and control of singing is presented. Pitch and vocal effort are controlled by a stylus on a graphic tablet. The concept of Syllabic Control Points (SCP) is introduced for timing and rhythm control. A ch...

متن کامل

Creating an individual speech rhythm: a data driven approach

Generating a near-to-natural speech rhythm can greatly contribute to the user's acceptance of TTS systems. Beside common aspects of the rhythm control (correctness of the segmental durations, robust function, etc.) rhythmic flexibility for several applications and individual speaking styles are desired. This article describes a data driven concept, which aims at the generation of an individual ...

متن کامل

MAGE 2.0: New Features and its Application in the Development of a Talking Guitar

This paper describes the recent progress in our approach to generate performative and controllable speech. The goal of the performative HMM-based speech and singing synthesis library, called Mage, is to have the ability to generate natural sounding speech with arbitrary speaker’s voice characteristics, speaking styles and expressions and at the same time to have accurate reactive user control o...

متن کامل

Performative faces

The paper presents a model for the construction of an artificial agent that can express performatives through facial expression. The performative of a speech act or communicative act is the particular communicative intention a Sender has to one's Addressee, the way one wants to socially relate oneself to the interlocutor. Performatives are decomposed both on the meaning and on the signal side: ...

متن کامل

Hereby explained: an event-based account of performative utterances

Several authors propose that performative speech acts are self-guaranteeing due to their self-referential nature (Searle 1989; Jary 2007). The present paper offers an analysis of self-referentiality in terms of truth conditional semantics, making use of Davidsonian events. I propose that hereby can denote the ongoing act of information transfer (more mundanely, the utterance) which thereby ente...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017